Extracting Social Power Relationships from Natural Language
نویسندگان
چکیده
Sociolinguists have long argued that social context influences language use in all manner of ways, resulting in lects 1 . This paper explores a text classification problem we will call lect modeling, an example of what has been termed computational sociolinguistics. In particular, we use machine learning techniques to identify social power relationships between members of a social network, based purely on the content of their interpersonal communication. We rely on statistical methods, as opposed to language-specific engineering, to extract features which represent vocabulary and grammar usage indicative of social power lect. We then apply support vector machines to model the social power lects representing superior-subordinate communication in the Enron email corpus. Our results validate the treatment of lect modeling as a text classification problem – albeit a hard one – and constitute a case for future research in computational sociolinguistics.
منابع مشابه
Potential Power and Problems in Sentiment Mining of Social Media
Sentiment mining (SM), also called opinion mining or sentiment analysis, has evolved over the last decade from text mining and natural language processing, but aims to determine the attitudes of individuals/groups with respect to some specific topics. More recently, SM has greatly assisted decision makers in extracting opinions from unstructured human-authored documents. SM is a computational p...
متن کاملSubsequence Kernels for Relation Extraction
We present a new kernel method for extracting semantic relations between entities in natural language text, based on a generalization of subsequence kernels. This kernel uses three types of subsequence patterns that are typically employed in natural language to assert relationships between two entities. Experiments on extracting protein interactions from biomedical corpora and top-level relatio...
متن کاملKnowledge Acquisition with Natural Language Processing in the Food Domain: Potential and Challenges
In this paper, we present an outlook on the effectiveness of natural language processing (NLP) in extracting knowledge for the food domain. We identify potential scenarios that we think are particularly suitable for NLP techniques. As a source for extracting knowledge we will highlight the benefits of textual content from social media. Typical methods that we think would be suitable will be dis...
متن کاملDomain Knowledge Extracting in a Chinese Natural Language Interface to Databases: NChiql
This paper presents the method of domain knowledge extracting in NChiql, a Chinese natural language interface to databases. After describing the overall extracting strategy in NChiql, we mainly discuss the basic semantic information extracting method, called DSE. A semantic conceptual graph is employed to specify two types of modification and three types of verbal relationship among the entitie...
متن کاملExtraction of protein interaction information from unstructured text using a context-free grammar
MOTIVATION As research into disease pathology and cellular function continues to generate vast amounts of data pertaining to protein, gene and small molecule (PGSM) interactions, there exists a critical need to capture these results in structured formats allowing for computational analysis. Although many efforts have been made to create databases that store this information in computer readable...
متن کامل